Seamless navigation in audio files

نویسنده

  • Christian Wellekens
چکیده

New audio services require editing tools for audio files. Indexing is a solution for fast access to specific information which could be speaker identity, location of speaker intervention on the file, topic identification. Good editing tools for text files have been available for many years and a solution for seamless navigation in an audio file could be the recognition of the content of the file to be edited (speech to text) but this requires in general, large vocabulary speaker independent recognizers giving acceptable results only for cooperative speakers restricting their speech to a domain for which a language model can be learned. Also even in that case, detection of musical chunks, intervention of a given speaker and segmentation in speakers remain interesting challenges. Mastering the complete indexing techniques will open the market for appealing consumer applications producing audio (but also video) programs on demand. Clearly access to multimedia databasesand multimedia archives will be easier.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Time Mosaics - an Image Processing Approach to Audio Visualization

This paper presents a new approach to the visualization of monophonic audio files that simultaneously illustrates general audio properties and the component sounds that comprise a given input file. This approach represents sound clip sequences using archetypal images which are subjected to image processing filters driven by audio characteristics such as power, pitch and signalto-noise ratio. Wh...

متن کامل

Seamless Navigation Using Various Sensors: an Overview of the Seamless Navigation Campaign

Seamless positioning techniques in indoor and outdoor environments are necessary for obtaining sensor locations. However, no definitive indoor-outdoor navigation system simultaneously provides high accuracy, high availability and low installation cost. Furthermore, crowded indoor-outdoor navigation systems consisting of multiple techniques will destructively interfere with each other, but an ex...

متن کامل

Sound Collage Creation on a Curved Touch Display

We present a novel audio workspace for creating sound collages based on a vertically curved display. In contrast to flat interactive surfaces, this form factor avoids ergonomic problems of tabletop displays or vertical touch screens and enables continuous touch interaction across vertical and horizontal display parts. Additionally, it allows combining established software and hardware component...

متن کامل

[The power of speech].

fects. In effect, the mission would be a topographic imager that would yield a water map of volumetric gain or loss after each overpass (14). Such a satellite mission would enable hydrologists to move beyond the point-based gauging methods of the past century to measurements of the spatial variability inherent in surface water hydrology. Global coverage would ensure that, despite local economic...

متن کامل

Emergency Related Video Streaming in VANETs using Network Coding

Vehicular communications are becoming a reality driven by various applications. Among those applications safe navigation support is of most significance. In designing such navigation safety applications, reliable dissemination of data, i.e., every affected vehicle receives data, is the key issue. Past research focused on the reliable dissemination problem of plain media type (e.g. text) safety ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001